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Abstract. We use the viewpoint of the formal calculus underlying vertex operator al- 
gebra theory to study certain aspects of the classical umbral calculus. We begin by 
calculating the exponential generating function of the higher derivatives of a composite 
function, following a very short proof which naturally arose as a motivating computa- 
tion related to a certain crucial "associativity" property of an important class of vertex 
operator algebras. Very similar (somewhat forgotten) proofs had appeared by the 19-th 
century, of course without any motivation related to vertex operator algebras. Using this 
formula, we derive certain results, including especially the calculation of certain adjoint 
operators, of the classical umbral calculus. This is, roughly speaking, a reversal of the 
logical development of some standard treatments, which have obtained formulas for the 
higher derivatives of a composite function, most notably Faa di Bruno's formula, as a 
consequence of umbral calculus. We also show a connection between the Virasoro algebra 
and the classical umbral shifts. This leads naturally to a more general class of operators, 
which we introduce, and which include the classical umbral shifts as a special case. We 
prove a few basic facts about these operators. 



1. Introduction 

We present from first principles certain aspects of the classical umbral calculus, con- 
cluding with a connection to the Virasoro algebra. One of our main purposes is to show 
connections between the classical umbral calculus and certain central considerations in 
vertex operator algebra theory. The first major connection is an analogue, noted in |FLMj . 
of those authors' original argument showing that lattice vertex operators satisfy a certain 
fundamental associativity property. Those authors observed that this analogue amounts 
to a simple calculation of the higher derivatives of a composite function, often formulated 
as Faa di Bruno's formula. The philosophy of vertex operator algebra theory led those 
authors to emphasize the exponential generating function of the higher derivatives rather 
than the coefficients (which are easily extracted). That generating function was the ana- 
logue of a certain vertex operator. We shall show how taking this as a starting point, one 
may easily (and rigorously) recover significant portions of the classical umbral calculus of 
Sheffer sequences. The main aim, part of ongoing research, is to further develop the anal- 
ogy between vertex operator algebra theory and classical umbral calculus. In addition, 
a direct connection between the classical umbral shifts and the Virasoro algebra (which 
plays a central role in vertex operator algebra theory) is established in the second half of 
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this paper. Further analogies between vertex algebra formulas and classical umbral calcu- 
lus formulas are noted in connection with this result and these motivate a generalization 
of the classical umbral shifts, which we briefly develop at the conclusion of this paper. 

The classical umbral calculus has been treated rigorously in many works following the 
pioneering research of Gian-Carlo Rota, such as e.g. |MR] . |RKO] . |Gaj . |Rtj . |RR] . 
|Rmlj . [Fr] , [T] and |Chj . For an extensive bibliography through 2000 we refer the reader 
to |BLj . The general principle of umbral techniques reaches far beyond the classical 
umbral calculus and continues to be a subject of research (see e.g. |DSj . [N] and [Z2J ) . 
Our treatment involves only certain portions of the classical umbral calculus of Sheffer 
sequences as developed in |Rmlj . 

There are many proofs of Faa di Bruno's formula for the higher derivatives of a com- 
posite function as well as related formulas dating back to at least the early 19th century 
(see [Jo] for a brief history, as well as [X], [B], (BE], [FT], [Fj, [Lu], [Hi], and [Sc]). 
Moreover, it is a result that seems basic enough to be prone to showing up in numerous 
unexpected places, such as in connection with vertex operator algebra theory and also, 
as I recently learned from Professor Robert Wilson, in the theory of divided power alge- 
bras, to give just one more example. Here, for instance, a special case of Faa di Bruno's 
formula implies that certain coefficients are combinatorial and therefore integral, which 
is the point of interest since one wants a certain construction to work over fields of finite 
characteristic (see e.g. Lemma 1.3 of [Wi] ) . Faa di Bruno's formula is purely algebraic or 
combinatorial. For a couple of combinatorial proofs we refer the reader to |Z1] and [Chj . 
however we shall only be concerned with algebraic aspects of the result in this paper. 

Our interest in Faa di Bruno's formula is due to its appearance in two completely 
separate subjects. First, it has long well-known connections with umbral calculus and 
second, perhaps more subtly, it shows up in the theory of vertex operator algebras. There 
are several umbral style proofs of Faa di Bruno's formula. According to [Jo], an early one 
of these is due to Riordan |Ril] using an argument later completely rigorized in |Rm2] and 
|Chj . Perhaps even more important, though, is the point of view taken in Section 4.1.8 of 
|Rmlj . where the author discusses what he calls the "generic associated sequence," which 
he relates to the Bell polynomials, which themselves are closely related to Faa di Bruno's 
formula. The first part of this paper may, very roughly, be regarded as showing a way to 
develop some of the classical umbral calculus beginning from such "generic" sequences. 
We also bring attention more fully to |Ch] in which the formalism of "grammars" and 
some of the techniques quite closely resemble our approach at this stage, as I recently 
became aware. 

Faa di Bruno's formula (in generating function form) appears in the theory of vertex 
operator algebras as originally observed in |FLM] . Briefly, Faa di Bruno's formula ap- 
peared in generating function form as an analogue, noted in |FLM] . of those authors' 
original argument showing that lattice vertex operators satisfy a certain fundamental as- 
sociativity property. The work |FLM] deals with many topics, but the parts which are of 
interest to us have to do with vertex operator algebra theory as well as, in particular, the 
Virasoro algebra, which is a very important ingredient in vertex operator algebra theory. 
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We note that although certain crucial material from the theory of vertex operator alge- 
bras plays an essential role in the motivation of this paper, it turns out that we do not 
need explicit material directly about vertex operator algebras for the present work. By 
way of the literature, we briefly mention that the mathematical notion of vertex algebras 
was introduced in [B] and the variant notion of vertex operator algebra was introduced 
in jFLMj. An axiomatic treatment of vertex operator algebras was given in [FHL] and a 
more recent treatment was presented in |LL] . The interested reader may consult |L2] for 
an exposition of the history of the area. 

This work began, unexpectedly, with certain considerations of the formal calculus devel- 
oped to handle some of the algebraic, and ultimately, analytic aspects of vertex operator 
algebra theory. Those considerations were related to elementary results in the logarithmic 
formal calculus as developed in |Mi] and [HLZJ. However, we shall not discuss the con- 
nection to the logarithmic formal calculus here (for this see |R1] and |R2j ) since another 
more classical result stemming from vertex algebra theory turns out to be more central 
to this material, namely that calculation which amounted to a calculation of the higher 
derivatives of a composite function, which was mentioned above. For the details of this 
calculation, see the introduction to Chapter 8 as well as Sections 8.3 and 8.4 of |FLM] 
and in particular Proposition 8.3.4, formula (8.4.32) and the comment following it. 

The Virasoro algebra was studied in the characteristic case in |GFj and the character- 
istic p analogue was introduced by R. Block in [B1J . Over C it may be realized as a central 
extension of the complexified Lie algebra of polynomial vector fields on the circle, which 
is itself called the Witt algebra. A certain crucial operator representation was introduced 
by Virasoro in [V] with unpublished contributions made by J.H. Weis, and the operators 
of this representation play a well known and essential role in string theory and vertex 
operator algebra theory (cf. |FLMj ). Our connection with umbral calculus is made via 
one of these operators. 

Since this paper is interdisciplinary, relating ideas in vertex operator algebra theory 
and umbral calculus, we have made certain choices regarding terminology and exposition 
in an effort to make it more accessible to readers who are not specialists in both of these 
fields. Out of convenience we have chosen |Rml] as a reference for standard well-known 
results of umbral calculus. A well known feature of umbral calculus is that it is amenable 
to many different recastings. For instance, as the referee has pointed out, many of the 
main classical results, recovered from our point of view in Section 4, concerning adjoint 
relationships also appeared in jFfj> where what Roman |Rmlj refers to as "adjoints" are 
very nicely handled by a certain type of "transform." The change in point of view, 
among other things, gives a very interesting alternative perspective on the results and we 
encourage the interested reader to compare the treatments. However, in the interests of 
space, when we wish to show the equivalence of certain of our results with the literature 
we will restrict ourselves to using the notation and framework in |Rmlj . 

In this paper we attempt to avoid specialized vocabulary as much as possible, although 
we shall try to indicate in remarks at least some of the important vocabulary from clas- 
sical works. We shall use the name umbral calculus or classical umbral calculus since 
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this seems to enjoy widespread name-recognition, but as the referee pointed out "finite 
operator calculus" might be a more appropriate name for much of the material such as 
the method in Proposition 13.11 and relevant material beginning in Section 4 of this work. 
We have also attempted to keep specialized notation to a minimum. However, because 
the notation which seems natural to begin with differs from that used in |Rml] we do 
include calculations bridging the notational gap in Section 4 for the convenience of the 
reader. We note that the proofs of the results in Section 4 are much more roundabout 
than necessary if indeed those results in and of themselves were what was sought. The 
point is to show that from natural considerations based on the generating function of 
the higher derivatives of a composite function, one does indeed recover certain results of 
classical umbral calculus. 

We shall now outline the present work section-by-section. In Section |2j along with 
some basic preliminary material, we begin by presenting a special case of the concise 
calculation of the exponential generating function of the higher derivatives of a composite 
function which appeared in the proof of Proposition 8.3.4 in |FLM] . Using this as our 
starting point, in Section |3]we then abstract this calculation and use the resulting abstract 
version to derive various results of the classical umbral calculus related to what Roman 
|Rmlj called associated Sheffer sequences. The umbral results we derive in this section 
essentially calculate certain adjoint operators, though in a somewhat disguised form. In 
Section HI we then translate these "disguised" results into more familiar language using 
essentially the formalism of |Rml] . We shall also note in this section how umbral shifts 
are defined as those operators satisfying what may be regarded as an umbral analogue of 
the L(— l)-bracket-derivative property (cf. formula (8.7.30) in |FLM] ). The observation 
that such analogues might be playing a role was suggested by Professor James Lepowsky 
after looking at a preliminary version of this paper. 

In Section we make an observation about umbral shifts which will be useful in the 
last phase of the paper. 

In Section we begin the final phase of this paper, in which we relate the classical um- 
bral calculus to the Virasoro algebra of central charge 1. Here we recall the definition of 
the Virasoro algebra along with one special case of a standard "quadratic" representation; 
cf. Section 1.9 of |FLM] for an exposition of this well-known quadratic representation. We 
then show how an operator which was central to our development of the classical umbral 
calculus is precisely the L(—l) operator of this particular representation of the Virasoro 
algebra of central charge 1. Using a result which we obtain in Section 13 we show a rela- 
tionship between the classical umbral shifts and the operator now identified as L{— 1) and 
we then introduce those operators which in a parallel sense correspond to L(n) for n > 0. 
(Strictly speaking, by focusing on only those operators L(n) with n > —1, which them- 
selves span a Lie algebra, the full Virasoro algebra along with its central extension remain 
effectively invisible.) We conclude by showing a couple of characterizations of these new 
operators in parallel to characterizations we already had of the umbral shifts. In partic- 
ular we also note how the second of these characterizations, formulated as Proposition 
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17.21 may be regarded as an umbral analogue of (8.7.37) in |FLM] . extending an analogue 
already noted concerning the L(— l)-bracket-derivative property. 

We note also that Bernoulli polynomials have long had connections to umbral calcu- 
lus (see e.g. |Mel] ) and have recently appeared in vertex algebra theory (see e.g. [LI] 
and [DLM] ). It might be interesting to investigate further connections between the two 
subjects that involve Bernoulli polynomials explicitly. 

This paper is an abbreviated version of part of [R3] (cf. also [R4] ) . The additional 
material in the longer versions is largely expository, for the convenience of readers who 
are not specialists. 

I wish to thank my advisor, Professor James Lepowsky, as well as the attendees (regular 
and irregular) of the Lie Groups/Quantum Mathematics Seminar at Rutgers University 
for all of their helpful comments concerning certain portions of the material which I 
presented to them there. I also want to thank Professors Louis Shapiro, Robert Wilson 
and Doron Zeilberger for their useful remarks. Additionally, I would like to thank the 
referee for many helpful comments. 

Finally, I am grateful for partial support from NSF grant PHY0901237. 



2. Preliminaries 

We set up some notation and recall some well-known and easy preliminary propositions 
in this section. For a more complete treatment, we refer the reader to the first three 
sections of Chapter 8 of [FLMJ (cf. Chapter 2 of |LLj ). while noting that in this paper 
we shall not need any of the material on "expansions of zero," the heart of the formal 
calculus treated in those works. 

We shall write t, u, v, w, x, y, z, x n , y m , z n for commuting formal variables, where n > 
and m G Z. All vector spaces will be over C. Let V be a vector space. We use the 
following: 

(formal power series), and 

C[x] = < c n x n \c n G C, c n = for all but finitely many n > 

(formal polynomials). 

We denote by ^ the formal derivative acting on either C[x) or C[[x]]. Further, we shall 
frequently use the notation e D to refer to the formal exponential expansion, where □ is 
any formal object for which such expansion makes sense. By "makes sense" we mean 
that the coefficients of the monomials of the expansion are finite objects. For instance, 
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we have the linear operator e w ^ : C[[x, x *]] — > C[[x,x l ])[[w]]: 

d \ w n ( d 
n\ V dx 



n>0 

We recall that a linear map D on an algebra A which satisfies 

D(ab) = (Da)b + a(Db) for all a,b e A 

is called a derivation. Of course, the linear operator when acting on either C[x] or 
C[[x]] is an example of a derivation. 

It is a simple matter to verify, by induction for instance, the following version of the 
elementary binomial theorem. Let A be an algebra with derivation D. Then for all a, 
b G A, we have: 

(2.1) D«a b= J2(k + iy.^^ 

k+l=n 

(2.2) e wD ab = (e wD a) (e wD b) . (the automorphism property) 

Further, we separately state the following important special case of the automorphism 
property. For f(x),g(x) G C[[x]], 



d 



The automorphism property shows, among other things, how the operator e w dx may 
be regarded as a formal substitution, since, for n > 0, we have: 



e w ^ x n = (e w ^x) = (x + w) n 



Therefore, by linearity, we get the following polynomial formal Taylor formula. For p(x) G 

CM, 

e w d^p(x) = p(x + w). 

Since the total degree of every term in (x + w) n is n, we see that e w ^ preserves total 
degree. By equating terms with the same total degree we can therefore extend the previous 
proposition to get the following. For f(x) G C[[x]], 

(2.3) e w ^f(x) = f(x + w). 

Remark 2.1. We note that the identity ( 12.31) can be derived immediately by direct 
expansion as the reader may easily check. However, in the formal calculus used in vertex 
operator algebra theory it is often better to think of this minor result within a context 
like that provided above. For instance, it is often useful to regard such formal Taylor 
theorems concerning formal translation operators as representations of the automorphism 
property (see [Rlj . |R2] and Remark [2. 2p . 
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We have calculated the higher derivatives of a product of two polynomials using the 
automorphism property. We next reproduce (in a very special case, for the derivation J^), 
the quick argument given in Proposition 8.3.4 of |FLM] to calculate the higher derivatives 
of the composition of two formal power series. Let f(x),g(x) G C[[x]]. We further require 
that g(x) have zero constant term, so that, for instance, the composition f(g(x)) is always 
well defined. We shall approach the problem by calculating the exponential generating 
function of the higher derivatives of f(g(x)). We get 

e^f(g(x)) = f(g(x + w)) 

= f(g(x) + (g(x + w)- g(x))) 

e (ff(*+io)-ff(*))£ f( z 

fW(g(x)) 



E 



n>0 



z=g(x) 

e" J ^g{x) -g(x) 



( , 4) =E /^( E £^ 

n>0 \m>l 

While our calculation of the higher derivatives is not, strictly speaking, complete at this 
stage (although all that remains is a little work to extract the coefficients in powers of 
w), it is in fact this formula which will be of importance to us, since, roughly speaking, 
many results of the classical umbral calculus follow because of it, and so we shall record 
it as a proposition. 

Proposition 2.1. Let f(x) and g(x) G C[[x]]. Let g{x) have zero constant term. Then 
we have 

(2.5) rfM^Mfj^ 



n\ \ *■ — ' m\ 

n>0 \m>l 



□ 



A derivation of Faa di Bruno's classical formula may be found in Section 12.3 of |Anj . 
We shall not need the fully expanded formula. 

Remark 2.2. The more general version of this calculation (based on a use of the auto- 
morphism property instead of the formal Taylor theorem) appeared in [FLM] because it 
was related to a much more subtle and elaborate argument showing that vertex operators 
associated to lattices satisfied a certain associativity property (see |FLMj . Sections 8.3 and 
8.4 and in particular, formula (8.4.32) and the comment following it). The connection is 
due in part to the rough resemblance between the exponential generating function of the 
higher derivatives of a composite function in the special case f(x) = e x (see (\2.6\i below) 
and "half of" a vertex operator. 
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Noting that in (12. 4 j) we treated g(x + w) — g(x) as one atomic object suggests a reor- 
ganization. Indeed by calling g(x + w) — g(x) = v and g(x) = u, the second, third and 
fourth lines of (12.41) become 



- ( n ) 

/(«+ W ) = c ^/(„)=5] ' 



ni 

n>0 



This is just the formal Taylor theorem, of course, and so we could have begun here and 
then re-substituted for u and v to get the result. This, according to [Jo], is how the proof 
of U. Meyer |Mej runs. 

It is also interesting to specialize to the case where f(x) = e x , as is often done, and 
indeed was the case which interested the authors of |FLM] and will interest us in later 
sections. We have simply 



(2.6) e w d^ e 9( x ) — e 9( x + w ) — e 9( x ) e 9(x+w)-g(x) _ gt/OOg - 1 S! 

Remark 2.3. The generating function for what are called the Bell polynomials (cf. Chap- 
ter 12.3 and in particular (12.3.6) in |Anj ) easily follows from (I2.6P using a sort of slightly 
unrigorous old-fashioned umbral argument replacing g^ with g m . See the proof of Propo- 
sition 13.11 for one way of handling such arguments. (The referee has pointed out that one 
may also rigorize this argument with certain evaluations of umbral elements in the umbral 
calculus whereas our argument in Proposition 13.11 is closer to the related finite operator 
calculus.) Of course, if we also set g(x) = e x — 1, we get e w ~^e eX ~ l = e eX w ~ l and setting 
x = is easily seen to give the well-known result that e e ™ -1 is the generating function of 
the Bell numbers, which are themselves the Bell polynomials with all variables evaluated 
at 1. 

For convenience we shall globally name three generic (up to the indicated restrictions) 
elements of C [[£]]: 

j.n j.n -in 

(2.7) A{t) = Y^A n - B(t) = Y / B n -. and C(f) = VC„-, 

^— ' n! n! ^-^ n! 

n>0 n>l n>0 

where both Bi ^ and Co ^ (and note the ranges of summation). We recall, and it 
is easy for the reader to check, that B(t) has a compositional inverse, which we denote 
by B(t), and that C(t) has a multiplicative inverse, C{t)~ l . We note further that since 
B(t) has zero constant term, B'(B(t)) is well defined, and we shall denote it by B*(t). In 
addition, p(x) will always be a formal polynomial and sometimes we shall feel free to use 
a different variable such as z in the argument of one of our generic series, so that A(z) is 
the same type of series as A(t), only with the name of the variable changed. 



Remark 2.4. We defined B*(t) = B'(B(t)). As the referee pointed out, it is also true 
that B*(t) = =7 

of B(B(t)) = t. 



that B*{t) = ^^y ; which follows from the chain rule by taking the derivative of both sides 



Remark 2.5. Series of the form B(t) are sometimes called "delta series" in umbral 
calculus, or finite operator calculus (cf. |Rmlj ). 

We shall also use the notation A^ n '(t) for the derivatives of, in this case, A(t), and it 
will be convenient to define this for all n G Z to include anti-derivatives. Of course, to 
make that well-defined we need to choose particular integration constants and only one 
choice is useful for us, as it turns out. 

Notation 2.1. For all n G Z, given a fixed sequence A m G C for all m G Z, we shall 
define 

A j-m—n 



.m — ny. 

m>n 

3. A RESTATEMENT OF THE PROBLEM AND FURTHER DEVELOPMENTS 

In the last section we considered the problem of calculating the higher formal derivatives 
of a composite function of two formal power series, f(g(x)), where we obtained an answer 
involving only expressions of the form f ( - n \g(x)) and Because of the restricted 

form of the answer it is convenient to translate the result into a more abstract notation 
which retains only those properties needed for arriving at Proposition 12.11 This essential 
structure depends only on the observation that 4:f( n '{g(x)) = f^ n+1 Kg(x))(g^(x)) for 
n > and that £g (m) (x) = g^ m+1 \x) for m > 1. 

Motivated by the above paragraph, we consider the algebra 

C[. . . , y-2, y-uyo, yi,...,x 1 ,x 2 ,...}. 

Then let D be the unique derivation on C[. . . , y^2, V-i, Vo, Vi, 1/2, • • • ,X\,x 2 , ■ ■ ■} satisfying 

Dyi = y i+1 X! i G Z 
Dxj = xj+i j > I. 

Then the question of calculating e w ~^ f(g(x)) as in the last section is seen to be essentially 
equivalent to calculating 

e wD y , 

where we "secretly" identify D with J^, f^ n \g(x)) with y n and with x m . We shall 

make this identification rigorous in the proof of the following proposition, while noting 
that the statement of said following proposition is already (unrigorously) clear, by the 
"secret" identification in conjunction with Proposition 12.11 

Proposition 3.1. We have 



(3.1) e wD y = J2 



n>0 



10 THOMAS J. ROBINSON 

Proof. Let f(x),g(x) G C[[x]] such that g(x) has zero constant term as in Proposition 12. II 
Consider the unique algebra homomorphism 

<f> ft g : C[. . . , y- 2 , y-i, yo, yi, ■ ■ ■ , x h x 2 , ■ ■ ■ ] -> C[[x]] 

satisfying 

<Pf, g Vi = f {i) {g{x)) »eZ and 

Then we claim that we have 



nXi = g (i) (x) i>\. 



<t>f,9 oD = ~^° far 

Since is a homomorphism and D is a derivation, it is clear that we need only check 
that these operators agree when acting on yi for isZ and Xj for j > 1. We get 

(0 /i9 o D)yi = ^(wh-xxO = = ^/%(*)) = ° W 

and 

° ar< = = 3 (m) (*) = ^ W (^) = ° 0/,<?J ^> 

which gives us the claim. Then, using the obvious extension of by ( 12. 5 p we have 

(3.2) />9 = e^to = /(«,(*)) = />9 £ ^ ^ - ! j , 

for all /(x) and g(x). 

Next take the formal limit as x — > of the first and last terms of (13. 2p . These identities 
clearly show that we get identities when we substitute /^(O) for y n and g^(0) for x n 
in ( I3.ip . But /^(0) and <7^(0) are arbitrary and since ( 13. ip amounts to a sequence of 
multinomial polynomial identities when equating the coefficients of w n , we are done. □ 

We observe that it would have been convenient in the previous proof if the maps 
had been invertible. We provide a second proof of Proposition 13.11 using such a set-up. 
This proof is closely based on a proof appearing in [Ch] . We hope the reader won't mind 
a little repetition. 

Proof, (second proof of Proposition 13. 1 p 

Let F(x) = ^2 n>0 an d G(x) = J2 n>1 ^f-. Consider the unique algebra homomor- 
phism 

ijj : C[. . .,y-2,y-i,Vo,Vi, ■ ■ ■ ,X!,x 2 , ■ ■ ■] C[. . •,y-2,y-i,Vo,Vi, ■ ■ -,x u x 2 , . ■ -}[[x]] 
satisfying 

ij;(y i ) = F®(G(x)) ieZ and 
^( Xi ) = G^(x) i>l. 
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Then we claim that we have 



d , 
ip o D = — o ip. 
ax 



Since ip is a homomorphism and D is a derivation, it is clear that we need only check that 
these operators agree when acting on yi for « G Z and Xj for j > 1. We get 



(V o D) W = = F( 4+1 )(G(a;))G'(x) = ±-F^{G{x)) = (J- o 



Vi 



and 

T ■ 



(ip o D)xi = ip(x i+1 ) = G« +1 \x) = -j^G®(x) = (J- o V 

which gives us the claim. Then, using the obvious extension of ^, we have 

(3.3) ip (e^yo) = e^Vfoo) = e w ^F(G(x)). 

But now we get to note that if) has a left inverse, namely setting x = 0, because 
F«(G(0)) = ?/j for i G Z and G^(0) = x< for i > 1. Thus we get 

(3.4) e wD y = (e<*F{G{x))) \ x=0 = F(G(x + w))\ x=0 = F(G(w)), 

which is exactly what we want. □ 

We note that our second proof of Proposition 13.11 did not depend on Proposition 12.11 
Completing a natural circle of reasoning, by using the first proof of Proposition 13. 1[ before 
invoking Proposition 12. 1\ we had from (13. 2p 

<Pf, 9 {e wD yo) = e w ^<p f , g y = e"*f(g(x)), 

which by (13.41) gives 

which gives us back Proposition 12.11 Thus we have shown in a natural way how Proposi- 
tions 12.11 and 13.11 are equivalent. 



Remark 3.1. One nice aspect of our second proof of Proposition 13.11 based closely on a 
proof in [Ch] , is that its key brings to the fore of the argument perhaps the most striking 
feature of the result, which is that the exponential generating function of higher derivatives 
of a composite function is itself in the form of a composite function. This, of course, is an 
old-fashioned umbral feature. Furthermore, it was the form of the answer, that it roughly 
resembled "half of a vertex operator," which was what interested the authors of |FLMj . 
This feature is also central to what follows. 

We may now clearly state the trick on which (from our point of view) much of the 
classical umbral calculus is based. It is clear that if we substitute A n for y n and B n 
for x n in (13. ip then the right-hand side will become A(B(w)). Actually, it will be more 
interesting to substitute xB n for x n . With this as motivation, we formally define two 
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(for flexibility) substitution maps. Let XB(t) an d i>A(t) be the algebra homomorphisms 
uniquely defined by the following: 

XB(t) : C[. . . , y , yx, . . . , x x , x 2 , • • ■ ] ->• C[. . . , y , J/i, • • • , x] 

with 

XB(t){yi)=Vi »e2 
XB(t)(a!j) = ^ i > !■ 



and 



with 



ip A{t) : C[. . . , 2/_i, 2/0, 2/1, . . . , a;] — ► C[x] 



V'A(t)(y*) = ^ *gz 

lf>A{t)(x) = X. 

Then we have 

(3.5) ° XB(t) (e" D 2/o) = A(xB(w)). 

To keep the notation from becoming cluttered, we shall sometimes abbreviate A(t) by 
simply A and make other similar abbreviations when there should be no confusion. 

We next note that it is not difficult to explicitly calculate the action of ipA ° Xb ° & wD 
on C[. . . , yo, y±, . . . , xi, x 2 , ■ ■ ■ }■ Indeed it is easy to see that we have 

(3.6) ip A °XBoe wD y n = A^ n \xB(w)) neZ 

(3.7) and i) A o Xb ° e wD x n = xB {n \w) n > I. 

These identities determine the action completely because of the automorphism property 
satisfied by e wD . 

The following series of identities (one of which is (13. 5p repeated) is immediate from 
what we have shown: 



(3.8) 


''Pa ° Xb ° e wD yi 


= A'(xB(w)) 


(3.9) 


ipA> °Xb° e wD y 


= A'(xB(w)) 


(3.10) 


d 

o- °^a°Xb ° e wD y-i 
ox 


= A(xB(w))B(w) 


(3.11) 


^tA(t) o xb ° e wD y 


= xB(w)A(xB(w)) 


(3.12) 


ipA ° Xb ° e wD y 


= A(xB(w)) 


(3.13) 


4>Aob °Xt° e wD y 


= A(B(xw)) 


(3.14) 


d 

^—°^a°Xb° e wD y 
ow 


= A'(x(B(w))xB'(w) 


(3.15) 


ipB*(t)A'(t) °XB°e wD y 


= B*(xB(w))A'(xB(w)) 
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We can now easily get the following proposition. 

Proposition 3.2. We have 

(1) A'(B(w)) = (>a o X b o e wD Vl ) \ x=1 = ty A , o X b o e wD y ) \ x=1 , 

(2) A(B(w))B(w) = (|o^o XB o e»Vi) U=i = (V>tA(t) ° ° e- D ?/o) U=i, 

(3) A(S(iw)) = (V^a °Xb° e wD y ) \ x= i = (4>AoB °Xt° e wD y ) \x=i, and 

(4) A'(B(w))B'(w) = £((^ A o XB o e wD y ) \ x=1 ) = {ip B *(t)A'(t) ° Xb ° e™°y ) U=i- 

Proo/. All the identities are proved by setting x = 1 in (I3TS|1 . f!33|) . f!3TTU|) . flBTTT]) . (1332]i . 
f l3.13p . fl3.14p and (13. 15ft and equating the results pairwise as follows. Equations ( I3.8J1 and 
(JS3D give (1); equations f !3~TU|) and pi) give (2); equations (J3J2D and IBI3) give (3); 
and equations f)3.14p and ( 13 . 1 5|) give (4). □ 

Each of the identities in Proposition 13.21 turns out to be equivalent to the fact that a 
certain pair of operators are adjoints. In order to see this, our next task will be to put the 
procedure of setting x = 1, used in Proposition 13.21 into a context of linear functionals. 
We shall do this in the next section. 

4. Umbral connection 

We set up a bra-ket notation following |Rml] so that we may display some of the results 
obtained in the formalism there presented. 

Notation 4.1. Let f(x) = X] n >o f n%n e Then we define 

(A(v)\f(x)) = J2fnAn, 
n>0 

where the symbol (-|-) is linear in each entry. In particular, (v k /k\\x n ) = 5k, n , where 5k, n 
is the Kronecker delta. 

So we are now viewing A(v) as a linear functional on C[x}. This leads us to the notion 
of adjoint operators, a key notion in the umbral calculus as presented in [Rml]. We shall 
soon show how to recover certain of the same results about adjoints from our point of 
view. 

Definition 4.1. We say that a linear operator <fi on C[x] and a linear operator 0* on 
C[[v]\ are adjoints exactly when, for all A{v) and for all p(x), the following identity is 
satisfied: 

(P(A(v))\p(x)) = (A(v)\<f>(p(x))). 

Of course, by linearity, it is equivalent to require that the identity in Definition 14.11 be 
satisfied for p(x) ranging over a basis of C[x}. In addition, we extend the bra-ket notation 
in the obvious way to handle elements of C[x][[w]] "coefficient- wise." 
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Proposition 4.1. If <fi is a linear operator on C[x] and <fi* is a linear operator on C[[v}} 
such that 

(<f>%A(v))\e* B M) = (A(v)\<!>(e* B M)), 
for all A(v) and B(w), then and 0* are adjoints. 

Proof. Equating coefficients of w n gives us the adjoint equation for a sequence of polyno- 
mials B n (x) of degree exactly n and arbitrary A(v). Since the degree of B n (x) is n, these 
polynomials form a basis and so the result follows by linearity. □ 

Remark 4.1. The sequence of polynomials B n (x) which appeared in the proof of Propo- 
sition |4J] have been called "basic sequences" or sequences of "binomial type" (see [MRJ). 
We shall call them "attached umbral sequences" (see Definition 14.41 and Remark 14. lip . 

The next theorem allows us to translate our "set x = 1" procedure from Proposition 
13.21 into the bra-ket notation. 

Theorem 4.2. Let u G C[y ,Z/i, • • • ,x] be of the form u = J2 n>0 u n y n x n where u n G C. 
Then we have: 

(A{v)\i) e t{u)) = (Vu(u))|*=i- 

Proof. We calculate to get: 

(A(v)\ip e t(u)} = (A(v)\ J2 u n% n ) = J2 UnAn = (Mu))\ x =i- 

n>0 n>0 

□ 

We may now easily observe that parts (1) and (2) of Proposition 13.21 yield adjoint 
relationships as the following theorem formalizes. Since the proofs of each statement are 
similar to reasoning described below (regarding parts (3) and (4) of Proposition I3.2D and 
are routine, we omit them, referring the reader to [R3j and |R4] for the details. 

Theorem 4.3. We have 

(1) p(x) G C[x], viewed as a multiplication operator on C[x] and p(j-) are adjoint 
operators. 

(2) F(-^) G C[[^]] and F(v) viewed as a multiplication operator on C[[v]] are adjoint 
operators. 

□ 

Remark 4.2. Part (1) of Theorem 14.31 appeared as Theorem 2.1.10 in [Rml] and Part 
(2) of Theorem 14.31 appeared as Theorem 2.2.5 in [Rml]. 

Parts (3) and (4) of Proposition 13.21 also amount to adjoint relationships. By (13. ip and 
Theorem 14. 2\ we have that part (3) of Proposition 13.21 is essentially equivalent to 

(4.1) A(B(w)) = (A(v)\i> et o XB o e wD y ) = (A(B(v))\^ o X t ° e wD y ), 
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which in turn, by ( 13. 5p . gives 

(4.2) {A(v)\e xB{w) } = (A(B(v))\e xw ). 

We have therefore effectively calculated the adjoint to the substitution map Sb which 
acts by Sb(9{v)) = g(B{v)) for all g(v) G C[[w]]. We simply need to make a couple of 
definitions. 

Remark 4.3. As mentioned in the introduction, some proofs in this section are "ineffi- 
cient" if the results are desired merely in and of themselves. As an example of this, we 
may observe that equation (14.21) . which essentially records a classical result as mentioned 
below, is obvious once one notes that (A(v)\e xw ) = A(w). 

Remark 4.4. We shall be defining certain linear operators on C[x] by specifying, for 
instance, how they act on e xw , which, recall, stands for the formal exponential expansion. 
Of course, by this we mean that the operator acts only on the coefficients of w n n > 0. 
We have already employed similar abuses of notation with the action of in the proof 
of Proposition 13.11 and with the bra-ket notation as mentioned in the comment preceding 
Proposition 14.11 

We now recall the definition of certain "umbral operators"; cf. Section 3.4 in |Rmlj 
More particularly, the umbral operator attached to a series B{w) in this work is the same 
as the umbral operator for B(w) in |Rmlj . 

Remark 4.5. We shall attempt to always use the word "attached" in this context to 
indicate the slight discrepancy of notation from Roman's |Rmlj usage regarding the switch 
to the compositional inverse. 

Definition 4.2. We define the umbral operator attached to B(w) to be the unique linear 
map 9b '■ C[x\ — y C[x\ satisfying: 

6 B e xw = e xB{w) . 

Theorem 4.4. We have that S B and 6 B are adjoint operators. 

Proof. The result follows from Proposition 14. II and (14. 2p . □ 

Remark 4.6. Theorem 14.41 essentially appeared as Theorem 3.4.1 in |Rmlj . although in 
this work we have chosen some different characterizations of certain objects as definitions 
as discussed in the introduction. It is not difficult to tie up all the relevant information. 
For more details see [R3j . 

By (13.11) and Theorem 14.21 we have that part (4) of Proposition 13.21 is essentially equiv- 
alent to 

A'{B{w))B'{w) = ^-(A(v)\^ et o XB o e wD y ) = (B*(v)A'(v)\fa o X b ° e wD y ), 
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which in turn, by ( 13. 5p . gives 



d_ 

dw 



(A(v)\e 



X 



W) = (5*(t>)A'(f )|e x ' B(w ^) ^ 



(4.3) 



MO I 



9w 



e lB W) = (B*(v)A\v)\e xB{w) ). 



We now recall the definition of a certain important special class of "Sheffer shifts" ; cf. 
Section 3.6 in |Rmlj . More particularly, the umbral shift attached to a series B(w) in 
this work is the same as the umbral shift for B(w) in |Rml] . 

Remark 4.7. Regarding the word "attached," in this context see Remark 14.51 

Definition 4.3. For each B(w), let Db '■ C[x] — > C[x] be the unique linear map satisfying 



We call Db the umbral shift attached to B{w). 

Remark 4.8. As discussed in the Introduction and Remark 12.21 the authors of |FLM] 
were concerned with the exponential generating function of the higher derivatives of a 
composite function, because it roughly resembled "half of" a vertex operator. Following 
this analogy, we might say that ip e t o Xb ° e wD yo is an umbral analogue of ("half of") a 
vertex operator. Having made this analogy, one can see how using (I4.4p . we have defined 
"attached" umbral shifts as those operators satisfying an analogue of the L(—l) -bracket- 
derivative property, which is stated as the equality of the first and third expressions of 
formula (8.7.30) in |FLMj . See also Remark 17.51 

Theorem 4.5. We have that Db and B*{y) o are adjoints. 

Proof. The result follows from Proposition 14.11 and (14. 3p . □ 

Remark 4.9. Theorem 14.51 essentially appeared as part of Theorem 3.6.1 in |Rml] . where 
the author of that work had already, in addition, shown that the operators B*(t) o ^ are 
exactly the surjective derivations on C[[i]], a routine matter once we note that B*(t) is 
an arbitrary element of C[[£]] having a multiplicative inverse. We also mention a similar 
caveat for the reader regarding different choices of definitions between the present work 
and |Rmlj just as discussed in Remark 14.61 and in the Introduction. 

In closing this section we note obvious characterizations of the attached umbral oper- 
ators and attached umbral shifts in terms of the coefficients of their generating function 
definitions. For this it is convenient for us to recall the definition of attached umbral 
sequences; cf. Section 2.3 and Theorem 2.3.4 in particular in |Rmlj . We note that the 
umbral sequence "attached" to a series B(w) in this work is the same as the Sheffer 
sequence associated to B(w) in [Rml] . 

Remark 4.10. Regarding the use of the word "attached" in conjunction with umbral 
sequence, see Remark [4.51 



(4.4) 



D B e xB{w) 
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Definition 4.4. We define the sequence of polynomials B n (x), the umbral sequence at- 
tached to B(w), to be the unique sequence satisfying the following: 

2—* n! ^ n\ 

n>0 n>0 

Remark 4.11. We recall that the attached umbral sequences already appeared explicitly 
(though, of course not by name) in the proof of Proposition 14.11 (see Remark 14.11) . 

Proposition 4.2. We have that 9b '■ C[x] — > C[x], the umbral operator attached to B(w), 
is characterized as the unique linear map satisfying: 

9 B x n = B n {x). 

□ 

Proposition 4.3. We have that Db '■ C[x] — > C[x], the umbral shift attached to B(w), is 
characterized as the unique linear map satisfying: 

D B B n (x) = B n+1 {x). 

□ 

5. Umbral shifts revisited 

In this section we shall show a characterization of the attached umbral shifts which will 
be useful in Section [71 We begin by (for temporary convenience) generalizing Definition 

m 

Definition 5.1. For each A(t) and B(t), let D% : C[x] — > C[x] be the unique linear map 
satisfying 

D& xB{t) = §- t A(xB(t)). 
Recalling the identities (I3.6P and (|3.7p . we note that 

tpA°XB° De wD y = iPa°Xb° e wD y 1 x 1 

= ipA ° Xb ° {e wD yi) (e wD x t ) 
= A'{xB(w))xB'(w), 



so that 



A'(xB(w))xB'(w) =^j a o Xb o De wD y 

d 

= {ipA ° Xb o e wD y ) 
= Dg o ^ et o XB o e wD y . 
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Recalling that e stands for the formal exponential Taylor series, and extracting the 
coefficients in w n /n\ for n > from the second and fourth terms from the above identity 
yields: 

Db ° V>e* °Xb° D n y = ^ a o X b° D n+1 y . 

Furthermore, because ip e t o xb ° D n yo is a polynomial of degree exactly n, this formula 
characterizes the maps D^. 

Although we have briefly generalized the definition for the attached umbral shifts (in 
order to fit more closely with our calculations from Section [3]), the previous identity shows 
how it is natural to restrict our attention to the attached umbral shifts, and it is this case 
that will later interest us anyway. We may now state the characterization of the attached 
umbral shifts mentioned in the introduction to this section. 

Remark 5.1. While we only temporarily generalized the definition for attached umbral 
shifts, as the referee has pointed out, it might be nice to investigate whether extensions 
of standard umbral calculus calculations could be developed for the operators Dg. 

Proposition 5.1. The attached umbral shift, Db '■ C[x] — > C[x] is the unique linear map 
satisfying 

D B ° °Xb° D n y = ip e t o xb ° D n+1 y , 

for all n > 0. 

□ 

Remark 5.2. Proposition 15.11 was announced, together with a more direct proof, as 
Proposition 6.1 in |R2J. 

Remark 5.3. The classical umbral calculus can be considered to be the study of Sheffer 
sequences through "umbral" techniques. So far we have only considered a special case, 
attached umbral sequences. The general case may be obtained easily by using the results 
of the special case and this is probably the shortest route given the efforts we have 
already made. Alternatively one could give a complete parallel development. Recall 
that our approach began by calculating the higher derivatives of a formal composite 
function. That is, letting f(x),g(x) G C[[x]] such that the constant term of g(x) is zero, 
we began calculating the higher derivatives of f(g(x)). Let h(x) G C[[x]]. Then the 
general theory follows by the parallel argument with the starting point of calculating the 
higher derivatives of the product h(x)f(g(x)). The interested reader may see this worked 
out in detail in both |R3j and [R4J. 

6. Connection with the Virasoro algebra 

Our goal in this section will be to show that an operator closely related to the derivation 
D (which appeared in Section [3]) is one of the standard quadratic representations of the 
L[—l) operator of the Virasoro algebra. 
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Recall that we began our main investigation by calculating the higher derivatives of the 
composition of two formal power series f(x) and g(x), where the constant term of g(x) 
was required to be 0, following a proof given in |FLM] . In fact, the case that interested 
the authors in |FLM] was when f(x) = e x . It is not difficult to specialize our arguments 
to this case. When we abstract, we get the following set-up: Consider the vector space 
yC[x\, X2, X3, • • •] where Xj for j > 1 are commuting formal variables. Then let T> be the 
unique derivation on yC[xi, X2, £3, • • • ] satisfying 

(6 1) Vy = yXl 

T>Xj = x j+1 j > 1. 

The question of calculating e w ^e 9 ^ is seen to be essentially equivalent to calculating 

e wV y, 

where we "secretly" identify T> with -j^, e 9 ^ with y and with x m . It is clear by 



our identification, and rigorously as an easy corollary of Proposition 13.11 that: 



(6.2) e wV y = £ 

We note that 



n>0 



ye 



_ d d d 

v = x ^Tr + x ^ — — h ' " 

ay oxi 0x2 

d d 



dy ^ dx k 

We next shall switch gears in order to recall certain basics about the Virasoro algebra 
using operators arising from certain Heisenberg Lie algebras. We follow (a variant of) the 
exposition of this well-known material in |FLM] . Let f) be the one- dimensional abelian 
(complex) Lie algebra with basis element h. We define a nonsingular symmetric bilinear 
form on f) by [ah, bh) = ab for all a, b e C. We recall the (particular) affine Heisenberg 
Lie algebra f) which is the vector space 

i) = f) ® C[£, r 1 ] © Cc, 

with Lie brackets determined by 

[ah ®t m ,bh® t n ] = (ah, bh)mS m+n flC = abmS m+n flC, 

where c is central and 5 is the Kronecker delta. 

We may realize I) as differential and multiplication operators on a space with infinitely 
many variables as follows. We consider the space yC[xi,x 2 , x 3 , . . . ] and make the following 
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identification: 

{a(— n)x-n n < 
0{n)£- n > 
via » = o> 

where a(n), /3(n) G C for n > 1 and we identify c, the central element, with the multipli- 
cation by identity operator. Of course, in this setting y-^ is a fancy name for the identity 
operator, but we wrote it this way so that it appears explicitly as a derivation. It is easy 
to see that 



d 

a(n)x n ,/3(n)- 



-a{n)f3{n) 



dx r 

with all other pairs of operators commuting. Thus our identification gives a representation 
of the Heisenberg Lie algebra exactly when we require that for all n > 1 

a{n)f3{n) = n. 

For this representation we shall sometimes use the notation h(n) to denote the image of 
h ®f\ 

Definition 6.1. The Virasoro algebra of central charge 1 is the Lie algebra generated 
by basis elements 1, a central element, and L(n) for n G Z which satisfy the following 
relations for all m, n G Z. 

(6.3) [L(m), L(n)} = (m - n)L(m + n) + (l/12)(m 3 - m)5 m+nfi . 

Remark 6.1. The Virasoro algebra is a central extension of the Witt algebra, the Lie 
algebra of the derivations of Laurent polynomials in a single formal variable. The cor- 
respondence can be seen by identifying L(n) with —t n+1 -^- In fact, (cf. Proposition 
1.9.4 in |FLM] ) the Virasoro algebra with a general central element is the unique, up to 
isomorphism, one dimensional central extension of the Witt algebra. 

We state the following theorem without proof. It is a special case, for instance of 
Theorem 1.9.6 in |FLMj where a complete proof is provided. 

Theorem 6.1. The operators 

(6.4) L(n) = ~^2h(n- k)h(k) n^O and 

(6.5) ^o) = i5>(-|*|)fc(l*l) 

give a representation of the Virasoro algebra of central charge 1, 

The space yC[xi,x 2 , x 3 , . . . ] is obviously a module for the Virasoro algebra. It is graded 
by L(0) eigenvalues, which are called weights. In the literature, such a module is often 
called a lowest weight module; this module has y as a lowest weight vector. 
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We have 



£(0) = ^(0) 2 + h(-l)h(l) + h(-2)h(2) + ■ ■ ■ 



so that L(0)y = \y. Thus the lowest weight of the module is |. 

We may now show that by an appropriate (unique) choice of a(n) and 0(n) we get 
V = L(-l). We have 

L(-l) = h(-l)/i(0) + h{-2)h{l) + /i(-3)/i(2) + . . . 

(9 d d 

= a(l)x iy — + a(2)P(l)x 2 — + a(3)/3(2)x 3 — + .... 
oy ox i 0x2 

Therefore it is clear that in order to have D = L{— 1), we need exactly that 

at(l) = 1 and 

a(n + l)/3(n) = 1 n > 1, 

where we recall that we already have the restriction that a(n)/3(n) = n for all n > 1. 
These two sets of restrictions imply that 

(n + l)l3(n) = (3(n + l) n>l 

0(1) = 1, 

so that 

/3(n) = n! 

= (n^l)!' 

for all n > 1, is the unique solution. We record this as a proposition. 

Proposition 6.1. The operator L(—l) , given by with (and only with) both a(n) = 

( n ^ 1 ) ! an d Pin) = n\, is identical to the operator V, given by / Iff. 

□ 

For the remainder of this paper we shall assume that a{n) = ^zfy an d 0{ n ) = n - 

7. Umbral shifts revisited and generalized 

We shall continue to consider the space yC[xi, X2, ^3, ■ ■ ■ ] as in the previous section, 
and similarly to some of our previous work, such as in Section |3l we shall consider certain 
substitution maps. Let 4>B(t) denote the following algebra homomorphism. 

(j) B : yC[x 1 ,x 2 ,x 3 , ...]-» C[x] 
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with 

<pB(t)Xj = BjX J > 1 

and <j>B{t)V = 1. 

Then we have 

( j )B o e wV y = (f)B o e wi( - 1) ?/ = e xB(w) . 
In light of Proposition 15.11 it is routine to show the following. 

Theorem 7.1. The attached umbral shift, Db '■ C[x] — > C[x] is the unique linear map 
satisfying 

Db ° 4>b ° L(-l) n y = <p B ° £(-l) n+1 2/, 

/or all n > 0. 

□ 

With Theorem 17.11 as motivation, we make the following definition. 

Definition 7.1. For m > —1 we define the operators Dg(m) : C[x] —> C[x] to be the 
unique linear maps satisfying 

Pfl(m) o B o L(-l) n y = fa o L(m)L(-l)>, 

for all n > 0. 

Of course, 1) = -D^. These operators are well-defined because 4>b ° L{— l) n y has 

degree exactly n. In fact, (pB ° L(—l) n y = B n (x), the umbral polynomial attached to 
B(t). We call the operators T>b(u) generalized attached umbral shifts. 

We would like to use the Virasoro relations to help compute the generalized attached 
umbral shifts. We begin with the following lemma. 

Lemma 7.1. There exist rational numbers f m (n) such that 

L(m)L(-l) n y = f m (n)L(-l) n - m y, 
for all m > —1, n > such that n > m. 

Proof. We will need that L(m)L(—l) n y = when m > n. This is really due to the 
weights of the vectors, but in this paper we shall proceed, in just this special case, with 
an elementary induction argument. We induct on n. For n = this follows essentially 
because y is a lowest weight vector, but even without considering weights it is easy to 
directly see given the definition of the operators. By induction (used twice) we have 

L{m)L{-l) n y = Lf-ljiHlf-l)"- 1 !, + [L{m), L(-1)]L(-1)"- 1 y 

= (m + l)L{m - l^t-l)™- 1 ?/ 

= 0. 

We may now focus on the main argument. We establish the boundary cases. Letting 
m = — 1 we easily check that f-\{n) = 1. The other boundary is m = n. We shall use 
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another intermediate induction to establish this case. Our base case then is m = n = 
for which it is easy to check that /o(0) = 1/2. We also have 

L{n)L{-lfy = L(-l)L(n)L(-l) n ~ l y + [L(n), L(-l))L(-l)^ l y 

= (n + l)L(n-l)L(-l) n ~ 1 y, 

so that inducting on n we get our result. Moreover we now have the recurrence 

(7.1) /n(n) = (n + l)/ n _i(n-l) n > 1, 

with, as we have seen, the boundary /o(0) = 1/2. 

For our main argument we induct on m + n. We have already checked the base case. 
We then have by induction and using the Virasoro relations that for the remaining cases 
m > and n > m, we have 

L(m)L(-l) n y = L{-l)L{m)L{-l) n - l y + [L(m), L(-l)]L(-l)"- 1 y 

= f m (n - l)L(-l) n - m y + (m + l)L(m - l^-l^y 

= (f m (n - 1) + (m + l)/ m _i(n - 1)) L(-l) n ~ m j/. 

Therefore, not only do the values f n (m) exist but we have a recurrence for them 

(7.2) / m (n) = f m (n - 1) + (m + l)/ m _x(n - 1). 

□ 

In the last proposition we found a recurrence for certain values f m (n). We could extend 
the range of m and n to include all m > —1 and neZ and define f m (n) to be the solution 
to the recurrence equation found above which coincides when n > and n > m with the 
values already defined. In fact, it is easy to find a simpler boundary condition yielding 
the desired solution other than using the boundary with m = n. It is easy to see that 
instead we may specify that f m (0) = for m > 1, by considering fl7.ll) . which shows that 
we may specify O's below the diagonal. Further, it is easy to see from this recurrence 
equation with given boundary, that f m (n) is an integer for m^O and that fo(n) are half 
integers. It is also easy to see from this recurrence, by induction on n, that we have for 
n > that 

n 

fm(n) = f m (0) + (m + 1) ^ fm-i{n - i) 

8=1 

n-1 

(7.3) =/ ro (0) + (m+l)^/ Tn _ 1 (i). 

8=0 

We now give the natural generalization to Proposition 14.31 

Proposition 7.1. We have that Vb{itl) : C[x] — > C[x], the generalized umbral shift 
attached to B(t), is characterized as the unique linear map satisfying: 

(7.4) V B {m)B n (x) = f m (n)B n _ m (x), 
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where by convention B n (x) = for n < — 1. 

Proof. By definition 17.11 and Lemma 17.11 we have 

V B (m)B n (x) = <j) B L{m)L{-l) n y 

= f m (n)4> B L(-ir~ m y 

fm{jl)B n — m (x^. 

□ 

Remark 7.1. The convention in Proposition 17.11 that B n (x) = for n < — 1 is only used 
to ensure that in all cases the right hand side of f)7.4p is well defined. This condition could 
have allowed for much more flexibility since we already have that f m (n) = whenever 
n — m < —1. 

We shall next solve for simple closed (polynomial) expressions for / m (n) for fixed m. If 
we are willing to sum over squares, cubes etc. we could compute the answer for nonnega- 
tive n for each (fixed) m in turn, using (17. 3p . It is easy to verify that fo(n) = n + 1/2 and 
fi(n) = n 2 solve the first two cases. To solve for the remaining cases however, for variety, 
we shall use a heuristic argument making use of the Virasoro algebra relations to derive 
the answer. We have, for m + I < n, 

(I ~ m)f l+m {n)L(-l) n - l - m y = (I - m)L(l + m)L{-l) n y 

= [L(l),L(m)]L(-ir y 
= L(l)L{m)L{-l) n y - L{m)L{l)L{-l) n y 
= L(l)f m (n)L(-l) n - m y - L(m)/ i («)L(-l) ll - i 2/ 
= fi(n-m)f m (n)L(-ir- m - l y 
-Un-^f^Li-ir-^y 
= (/,(n - m)f m (n) - f m {n - J)/,(n)) L(-l) n - l - m y. 

We shall for the time being (unmathematically) ignore the restriction on the indices and 
get, for whenever all terms are well defined, the identity 

(/ - m)fi +m (n) = fi(n - m)f m (n) - f m (n- l)fi{n). 

It is easy to see that the case / = — 1 recovers 17.21 Further, one can check that the case 
I = does not yield any new information. For / = 1 we get 

(1 - m)f m +i(n) = fi(n - m)f m (n) - f m {n - l)/i(n) 
= (n - mff m (n) - n 2 f m (n - 1), 

so that it is easy to calculate, by simple substitution, each higher case (in m) starting 
with m = 2. The calculations are not difficult, of course, but I myself "cheated" and used 
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Maple to find and factor the first few answers, which yield an easy and obvious pattern 
as follows: 

/-i(n) = 1 
/o(n) = n + 1/2 
/i(n) = n 2 

/ 2 (n) = (l/2)n(n - l)(2n - 1) 

/ 3 (n)=n(n-l) 2 (n-2) 

/ 4 (n) = (l/2)n(n - l)(n - 2)(n - 3)(2n - 3) 

/ 5 (n) = n(n - l)(n - 2) 2 (n - 3)(n - 4) 

/ 6 (n) = (l/2)n(n - l)(n - 2)(n - 3)(n - 4)(n - 5)(2n - 5) 

/ 7 ( n ) = n (ri - l)(n - 2)(n - 3) 2 (n - 4)(n - 5)(n - 6) 

With that as a guide, we may now return to doing rigorous math and state and prove the 
following theorem. 

Theorem 7.2. The unique solution to the recurrence equation \1.2$ with m > — 1 and 
n G Z and wif/j boundary given by f-i(n) = 1, /o(0) = 1/2 and f m (0) = /or m > 1 is 
aif en 6y 

/-i(n) = 1 

/ m (n) = (l/2)(n(n-l)(n-2)---(n-m + l))(2n-m + l) /or m > 0. 

Proof. The proof is a straightforward calculation. Let m > (although admittedly the 
low m cases are a bit degenerate in this notation). Then we have 

f m {n) - f m {n - 1) = (l/2)(n(n - l)(n - 2) • • • (n - m + l))(2n - m + 1) 

- (l/2)(n - l)(n - 2) • • • (n - m)(2(n - 1) - m + 1) 

= (1/2) (n- l)(n -2) •• -(n-m+ 1)- 
• (n(2n — m + 1) — (n — m)(2(n — 1) — m + 1)) 

= (1/2) (n — l)(n — 2) • • • (n — m+ l)(2mn + 2n — m 2 — m) 

= (n — l)(n — 2) ■ ■ ■ (n — m + l)(m + l)(n — m/2) 

= (m + l)/ m -i(n - 1). 

□ 

Remark 7.2. Recalling (17.31) . it is easy to see that we could use the last result, perhaps 
somewhat awkwardly, to solve for the sum of squares and cubes etc., which happens to 
be related to the Bernoulli numbers, one of the motivating subjects for Blissard |Bli] and 
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is one of the classic problems solved via umbral methods (cf. Chapter 11 |Do] for a nice, 
succinct old-fashioned umbral style proof and also Chapter 3.11 |Mel] ) . 

Remark 7.3. We note that the umbral calculus has long been known to have connections 
to the Bernoulli numbers and polynomials (see e.g. |Mel] ) . Bernoulli polynomials have 
also appeared in the literature of vertex algebra theory (see e.g. |L1] and [DLMj ). Just 
as we have been establishing some analogues and connections between umbral calculus 
and vertex algebra theory, it might be interesting in future work to investigate further 
possible connections explicitly related to Bernoulli numbers and polynomials. 

Remark 7.4. We used a heuristic argument emphasizing the connection between umbral 
calculus and the Virasoro algebra to help guess a solution to f m (n) leading up to Theorem 
17.21 However, as the referee has pointed out, one can connect the result with classical 
umbral calculus as well. The calculations below essentially reproduce results obtained and 
shown to me by the referee and we shall follow their reasoning. Any errors or inelegance 
etc. in the particular exposition presented here are entirely due to me. Observing in all 
that follows the typical caveats about the degeneracy of cases for "low values of n", for 
n > let 



Assuming we know that t n (x) are polynomials, then using ( 17. 2p for n > 1 we get 

t n (x) = t n (x - 1) + nt n -i(x), 

since the result holds for all positive integral values of x. Using the formal Taylor theorem 
(12. 3p . we get for n > 1 that 



Therefore t n (x) is a Sheffer sequence (cf. Theorem 2.3.7 [Rml] ). It is easy to see (cf. 
Theorem 2.4.3 in |Rmlj ) that the relevant associated sequence Pk{y) satisfies 



(7.5) 



t n (x) = f n -i{x + n). 




t n (x - 1) + nt n -i(x) - t n -t(x - 1) 
nt n _i(x). 




which yields 



Pk(y) =y(y + i)---(y + k-i). 



We further have (cf. Theorem 2.3.9 [Rml] ) 




fc=0 
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It is easy to see that to(— 2) = 1, ti(— 2) = —1/2 and that t n (—2) = for n > 1 so that 
for n > 

t n {x) = + 2 )*n-fc(- 2 ) 

fc=0 ^ ' 

= p n (x + 2) - -p n -i(x + 2) 

= (x + 2){x + 3) • • • (x + n + 1) - -(x + 2)(x + 3)---(x + n) 
(7.6) = ~(x + 2){x + 3) • • • (x + n)(2x + n + 2). 

It is easy to check this formula using ( 17.51) and Theorem 17.21 The referee also pointed out 
that letting 

/ \ t n (x) ( / n _i(x + n)\ 

Sn{X) = — j- = : , 

n! \ n\ J 

it is easy to see that the following particularly simple recursion holds 

Snip) = Sn-l(x) + S n (x - 1), 

and, continuing to follow the referee, using (17. 6p it is easy to see that 

. . fx + n + l\ 1 fx + n\ 
Sn{x)= { n J " 2 U " J • 

We shall conclude this paper by stating and proving the natural generalization to the 
original formula defining the attached umbral shifts in Definition 14.31 

Proposition 7.2. For each m > —1, the map Vb{iti) : C[x] — > C[x] is the unique linear 
map satisfying: 



V B (m)e 



xB(w) 



W 



m+1 



d m + 1 



dw 



+ 



-w 
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Proof. We calculate, for m > — 1 (although once again the low m cases are a bit degen- 
erate) to get 

m+l d xB(w) _ ,„m+l _^ B„(x)w r 



<9w; <9w ^— ' n! 

n>0 



yv nB„(.r )//'"+'" 



n 

n>0 



\ - (re - m)B n _ m (x)w r 
' (re — m)! 

n>m 



n(n — 1) • • • (n — m)B n _ m (x)w n 



and 



w m e xB(w) = B n (x)w n+m 
^ re! 

n>0 

_ x - B n - m (x)w n 
^— ' (n — m)! 

= £~ 

so that it is easy to check that 



n>m 

re(re — 1) • • • (n — m + 1)-B n _ m (a;)w n 
n! 



(w^ 1 — + m + 1 ^ e xB(») = >p frn(n)B n ^ m (x)w T - 

\ dw 2 J ^ re! 

= V B (m)e xB{w) . 



□ 



Remark 7.5. Building on Remarks 12.21 and 14.81 we may regard Proposition 17.21 as an 
analogue of formula (8.7.37) in |FLM| in the cases where the re in (8.7.37) in |FLM] is 
restricted so that re > —1. The h in formula (8.7.37) in |FLM] should be replaced by 
the weight of the relevant lowest weight vector, which in our setting seems perhaps to 
correspond with the lowest weight of the module of the Virasoro algebra which we have 
been considering, which as we have noted is indeed 1/2. 
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